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CCL20/MIP-3tt is a /3-ehemokine expressed in the thymus, 
skin, and intestinal epithelial cells that exclusively binds 
and activates the CCR6 receptor in both mice and humans. 
The strict receptor binding specificity of CCL20 is excep- 
tional; other chemokines and their receptors bind promis- 
cuously with multiple partners. Toward determining the 
structural basis for the selective receptor specificity of 
CCL20, we have determined its three-dimensional struc- 
ture by NMR spectroscopy. CCL20 exhibits the same 
monomeric structure previously described for other che- 
mokines: a three-stranded /3-sheet and an overlying a-helix. 
The CCL20 receptor selectivity could arise from the rigid 
conformation of the N-terminal DCCL motif as well as the 
groove between the N-loop and the /3 2 -/3 3 hairpin, which is 
significantly narrower in CCL20 than in other chemokines. 
Similar structural features are seen in human /3-defensin 2, 
a small nonchemokine polypeptide reported to selectively 
bind and activate CCR6, which stresses their importance 
for the specific binding of both CCL20 and /3-defensin 2 to 
CCR6. CCL20’s structure will be useful to design tools 
aimed to modulate its important biological functions. 


Chemokines are a large family of nearly 50 small chemotac- 
tic cytokines that regulate the trafficking of leukocytes and the 
host inflammatory responses by specifically interacting with G 
protein -coupled receptors (1—4). Based on conserved cysteine 
residues in the N terminus of the molecules, CXC, CC, C, and 
CX3C chemokine subfamilies have been described (1, 4). The 
hallmark of chemokine function is considered to be the regula- 
tion of leukocyte trafficking, but chemokines also play impor- 
tant roles in the control of angiogenesis, tumor metastasis, 
hematopoiesis, and human immunodeficiency virus infection 
(1-5). These properties make chemokines and their receptors 
very interesting targets for therapeutic intervention. 
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Eighteen different chemokine receptors have been described so 
far (6). Their interactions with their ligands are clearly charac- 
terized by promiscuity, since most chemokine receptors are able 
to bind several chemokines and many chemokines can interact 
with several receptors. CCL20/MIP-3a, a CC chemokine, is an 
exception to this general rule. In both humans and mice, CCL20 
binds only to CCR6 receptor (7, 8), which is not bound or acti- 
vated by any other chemokine. Besides its strict receptor binding 
specificity, CCL20 has an interesting expression pattern. Murine 
CCL20 is expressed in the thymus (8), epithelial cells from intes- 
tinal tissue (9), and the follicle-associated epithelium in Peyer’s 
patches (10). In addition, data reported on human and mouse 
CCR6 show the specific expression of this receptor in different 
dendritic cell and lymphocyte subsets (8, 10-13). 

Taken together, these data suggest a role for CCR6/CCL20 
signaling in regulating the migration and recruitment of anti- 
gen-presenting and immunocompetent cells during inflamma- 
tory and immunological responses. Indeed, two recent studies 
on CCR6 -/ ~ mice showed that these animals have defects in 
leukocyte homing to the intestinal mucosa, impaired humoral 
immunity, and altered responses in contact hypersensitivity 
and delayed type hypersensitivity models of inflammation (14, 
15). Strikingly, a recent study showed that human /3-defensins, 
small polypeptides that contribute to host defense and whose 
overall amino sequence shows no homology to CCL20, are able 
to recruit dendritic and T cells by interacting only with CCR6, 
out of several different chemokine receptors tested (16). 

Detailed structural studies on different chemokines have 
been performed to help to identify the motifs that are key for 
the specificity of receptor binding and activation (17-36). These 
studies have been carried out on promiscuous chemokines with 
different degrees of receptor binding specificity, whose recep- 
tors are also able to bind some other chemokine(s). As an 
approach to study the basis of the strict binding specificity of 
CCL20 to CCR6, we have investigated the conformation of 
CCL20 by two-dimensional X H NMR spectroscopy. Here, we 
describe the complete sequential assignment and the tertiary 
structure in aqueous solution and propose preliminary quater- 
nary structures relevant to CCL20 function. In addition, the 
CCL20 structure is compared with those of other chemokines 
and with the /3-defensin 2 (BD2) 1 structure (37). On the basis of 


1 The abbreviations used are: BD1 and BD2, /3-defensin 1 and 2, 
respectively; HPLC, high pressure liquid chromatography; NOE, nu- 
clear Overhauser effect; NOESY, NOE spectroscopy; r.m.s., root mean 
square; RANTES, regulated on activation normal T cell expressed and 
secreted; ASA, accessible surface area. 
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these results, we have identified some of the CCL20 structural 
motifs that could mediate its specific binding to CCR6. Also, we 
described structural features in BD2 that could explain its 
binding to CCR6. 

EXPERIMENTAL PROCEDURES 

Chemical Synthesis of CCL20 — Mouse CCL20 (8) (ASNYDCCLSY 
IQTPLPSRAI VGFTRQMADE ACDINAIIFH TKKRKSVCAD PKQN- 
WVKRAV NLLSLRVKKM) was synthesized on a fully automatic pep- 
tide synthesizer (Applied Biosystems 430) by Fmoc (IV-(9-fluorenyl)me- 
thoxycarbonyl) chemistry, according to standard protocols (38). After 
synthesis, protecting side chain groups were removed, and the peptide 
was cleaved from the resin following the method of King et al . (39). 
Folding and disulfide bridge formation of the HPLC product was per- 
formed as described elsewhere (40). The purity and composition of the 
folded polypeptide was confirmed by HPLC on a C-18 Nucleosil 120 
analytical column (Tracer), by amino acid analysis using a Beckman 
6300 amino acid analyzer and by mass spectrometry using a matrix- 
assisted laser desorption and ionization time-of-flight mass spectrome- 
ter, model Reflex III (Bruker-Franzen Analytic GmbH, Bremen, Ger- 
many) equipped with the SCOUT™ source in positive ion reflector 
mode using delayed extraction and AutoXecute™ acquisition software. 
Ion acceleration voltage was 25 kV, The equipment was externally 
calibrated employing protonated mass signals from a peptide mixture 
covering the 1000-12,300 m/z range. The experimental molecular mass 
obtained was 7952.6 daltons (calculated molecular mass, 7948 Da). 

Sedimentation Equilibrium — Sedimentation equilibrium experi- 
ments were performed in a Beckman Optima XL- A ultracentrifuge 
using a Ti60 rotor and double sector centerpieces of Epon-Charcoal 
(optical pathlength 12 or 4 mm, depending upon peptide concentration). 
Samples of CCL20 in the concentration range from 0.15 to 0.02 mM and 
equilibrated in water with 200 mM NaCl, at pH 3.5 and 5.5, were 
centrifuged at 33,000 rpm and 20 °C. Radial scans at different wave- 
lengths (from 280 to 300 nm) were taken every 2 h until equilibrium 
was reached, as judged by there being no difference between two suc- 
cessive scans. The weight average molecular masses of CCL20 were 
determined by using the program EQ ASSOC (Beckman) (41) with the 
partial specific volume calculated from its amino acid composition (42). 

NMR Samples — NMR experiments were performed employing a ~1 
mM sample at 25, 35, and 45 °C, pH 3.5 and 5.5 (uncorrected for 
deuterium isotope effects), containing 200 mM NaCl in either H 2 0/D 2 0 
(90%/10%, v/v) or D 2 0. Sodium trimethylsilyl [2, 2, 3, 3- 2 HJ propionate 
was used as the internal chemical shift reference. 

NMR Spectroscopy — All spectra were acquired in the phase-sensitive 
mode with time-proportional phase incrementation mode (43) on a 
Bruker AMX-600 spectrometer equipped with a triple resonance probe 
and pulsed field gradients. Pulse gradients were used to suppress the 
water signal and undesired coherence pathways. *H homonuclear 
COSY (44), TOCSY with a mixing time of 60 ms (45), and NOESY (46) 
spectra with 80- and 150-ms mixing times were recorded by standard 
methods with water suppression achieved by selective presaturation of 
the water signal or by including the WATERGATE module (47) in the 
original pulse sequences. The size of the acquisition data matrix was 
2048 X 512 words in f z and f lt respectively. Before Fourier transforma- 
tion, the two-dimensional data matrix was multiplied by a phase- 
shifted sine bell or square sine bell window function in both dimensions. 
The corresponding shift was optimized in every experiment. Base-line 
correction was applied in both dimensions. All spectra were processed 
and analyzed using XWINNMR (Bruker, Analytic Messtechnik GmbH, 
Ettlingen, Germany) and ANSIG (48) software on an IRIS Indigo work 
station. 

Restraint Assignment — NMR resonances were assigned using 
standard sequential assignment procedures (49). Spin systems were 
identified by analysis and comparison of the COSY and TOCSY spectra. 
The through-space connectivities were then determined using the 
NOESY spectra. The use of three temperatures and two pH values to 
record the spectra (25, 35, and 45 °C at pH 3.5 and 25 and 35 °C at pH 
5.5) allowed us to resolve ambiguities that arose from chemical shift 
degeneracies. 

The connectivities from NOESY spectra recorded at pH 3.5 were used 
as input for the structure calculation. After integration, the peak vol- 
umes and chemical shift lists were output to a DYANA-compatible 
format (50). For each NOESY spectrum, the automatic calibration 
method implemented in DYANA was run to transform the peak vol- 
umes into distances. The upper limits obtained were used without 
modification for the initial structure calculations. The assignment of 
cross-peaks in the NOESY spectra was carried out in several steps. A 
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preliminary NOE data set was used to calculate initial structures with 
DYANA. The resulting conformers with lowest energies were used to 
assign additional peaks in the NOESY spectra. The cycle of calculations 
and assignments was repeated until no further assignments were pos- 
sible. The quality of the restrictions was checked by analyzing the 
restraint violations of the calculated conformers. The NOE cross-peaks 
corresponding to restrictions that were consistently violated in a sig- 
nificant number of structures were checked for possible overlap, and the 
corresponding restraints were consequently modified. This cycle was 
repeated until no consistent violation was detected. 

All structure calculations included the disulfide bond restraints 
Cys 6 -Cys 32 and Cys 7 -Cys 48 previously determined for CCL20, and in 
each case, these were fixed to a distance of 2.02 i: 0.1 A. Slowly 
exchanging amide protons were identified in a series of TOCSY spectra 
recorded after dissolving fully protonated protein in D 2 0. Hydrogen 
bond restraints were included in the later rounds of the structure 
calculation when the amide proton signal persisted in the D 2 0 spec- 
trum and if an unambiguous partner was present in at least one-third 
of the preliminary structures. Each hydrogen bond was specified by two 
distance restraints (HN-O distance of 1.7-2. 3 A and N-O distance of 
2.4— 3.3 A). In the initial stages of the calculations, <f) angles were 
restrained to the range 0-180°, except for Gly and Asn residues. These 
restrictions were removed in the last steps of the calculation. 

Structure Calculations and Analysis — The three-dimensional struc- 
ture of CCL20 was determined using the program DYANA (50). Accord- 
ing to the standard procedure, default weighting factors were attrib- 
uted to the different restraint categories for different types of protons. 
Pseudoatom corrections were added to those upper limit distances 
involving stereospecific atoms that were not stereospecifically assigned. 
All peptide bonds were fixed as trans. All histidine, arginine, and lysine 
residues were regarded as positively charged, while the glutamate and 
aspartate side chains were treated as negatively charged. We have used 
a simulated annealing protocol in which 60 randomly generated struc- 
tures were cooled down during 5000 torsion angle dynamics cycles to a 
final temperature of 0 K. Then a second simulated annealing step of 800 
torsion angle dynamics cycles was done at low temperature with the 
minimized structures. 

Following the torsion angle dynamics calculation, the 20 conformers 
with the lowest target function values were subjected to restrained 
energy minimization using the GROMOS force field (51). The resulting 
20 energy minimized conformers represent the solution structure of 
CCL20. 

The program MOLMOL (52) was used to visualize and globally 
characterize the structures. All r.m.s. deviation values reported here 
are mean pairwise r.m.s. deviation. Global r.m.s. deviation values are 
obtained from the best superposition of all considered residues. 

RESULTS 

CCL20 Is in an Association Equilibrium — Previously, sev- 
eral chemokines have been found to form dimers or higher 
order oligomers (53). To determine whether CCL20 oligomer- 
izes in conditions similar to those used for NMR measure- 
ments, equilibrium sedimentation studies were performed. The 
determined mass of CCL20 in 200 mM NaCl at 20 °C, pH 5.5, 
and a variety of concentrations, shows a wide molecular mass 
distribution between 8.2 and 11.7 kDa (monomers’ calculated 
molecular mass, 7948 Da). Similar results were found in the 
solution studies of CCLll/eotaxin (32), which showed that low 
pH values favor the monomeric species in the equilibrium. To 
test if this was also the case for CCL20, its molecular mass was 
also determined for samples at pH 3.5, and the result was 
similar to that mentioned above for pH 5.5. The equilibrium 
was also visible in the NMR spectra under several different 
conditions. At pH 5.5 and 25—35 °C, several chemical exchange 
peaks were clearly visible on TOCSY and NOESY spectra, 
confirming the existence of slow exchange (on the NMR time 
scale) of nuclei between different environments. The kinetics of 
the equilibrium is, however, pH-dependent, and is more rapid 
at lower pH values. Consequently, at pH 3.5, only a single set 
of signals, corresponding to the average of the populations 
present in solution, is observed in the NMR spectra. 

Assignments — Resonance assignments were achieved at pH 
3.5 and pH 5.5 using standard two-dimensional sequential 
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assignment techniques. At pH 5.5, the assignment process was 
hindered by the presence of two sets of signals and significant 
line broadening of some specific resonances. Consequently, sig- 
nals corresponding to the amide protons of Phe 23 , Arg 25 , Gin 26 , 
Met 27 , Arg 44 , Lys 45 , Lys 57 , Leu 65 , Arg 66 , Lys 69 , and Met 70 were 
not observable at this pH. In addition, exchange cross-peaks 
were identified for some resonances (e.g. Phe 23 H a (5.84 and 
5.50 ppm) and Thr 41 H a (5.71 and 5.58 ppm)). These data 
indicate that these residues lie in two different magnetic envi- 
ronments and suggest, as observed for other chemokines (32), 
that these regions may be involved in forming quaternary 
contacts. At pH 3.5, all spin systems could be identified, and 
sequential cross-correlations were established on the basis of 
H a -HN (H a -H s for Pro), H^-HN, and/or HN-HN NOEs observed 
between adjacent residues of the protein. For all residues, at 
least one sequential NOE could be unambiguously assigned 
with the exception of Leu 62 -Leu 63 due to signal overlap. The 2 H 
assignments, with the exception of rapidly exchanging protons, 
are almost complete. These assignments include all of the 
protons of the backbone and side chains excluding one H y in 
Pro 51 , due to overlap, and the Met 70 methyl group. Met 70 is the 
C-terminal residue, and the high flexibility typical of such 
residues probably explains why its methyl group is not ob- 
served. The 1 H chemical shifts of murine CCL20 chemokine 
(pH 3.5, 35 °C) have been deposited in the BioMagResBank 
under accession number 2465. 

The intensities of the main chain NOEs, the observed short 
and long range NOEs, and the conformational chemical shift of 
the 1 H a resonances indicate that the secondary structure of 
mouse CCL20 in solution consists of a single turn of helix, three 
/3-strands, and a C-terminal a-helix, as has been observed in 
previous chemokine structures (17, 18). A total of 20 amide 
protons show protection after 15 min of deuterium exchange. 
The location of the protected amide protons shows broad agree- 
ment with the limits of secondary structure determined by the 
NOE data. This is similar to the hydrogen exchange pattern of 
other chemokines (54). The chemical shift index (55) and a 
schematic representation of the antiparallel /3- sheet as deter- 
mined from the NOE connectivities are shown in Fig. 1. 

Calculation of the Solution Structure of Monomeric CCL20 — 
The detailed analysis of the NOESY spectra of CCL20 at pH 
3.5, at different protein concentrations and temperatures, did 
not reveal NOEs that could be attributed to intermonomer 
contacts, not even from residues at the N terminus or in the 
first /3- strand. Accordingly, all of the assigned NOESY cross- 
peaks have been used in this work to build a monomeric model 
for CCL20 compatible with the experimental data. The input 
restraints used for the final round of structure calculations are 
listed in Table I. A total 1349 NOEs were collected and trans- 
lated into 1004 relevant distance restraints, and 40 hydrogen 
bond restraints (two for each hydrogen bond; see “Experimen- 
tal Procedures”) were derived from the slow proton/deuterium 
exchange rates. The average number of restrictions per residue 
was 14.9 considering all residues and 17.1 for the well ordered 
residues (positions 6-66). As expected for a globular protein, 
the restrictions are well distributed throughout the structure 
(Fig. 2). The NMR structure of CCL20 is represented by 20 
conformers with an average pairwise r.m.s. deviation of 1.70 
and 2.31 A for the backbone and all heavy atoms, respectively 
(Table II). When the N- and C-terminal ends (positions 1-5 and 
67-70, respectively) are excluded from the comparison, the 
average pairwise r.m.s. deviation drops to 0.53 A for the back- 
bone and 1.36 A for all heavy atoms. As is generally observed, 
the elements of secondary structure are the best defined re- 
gions of the protein. The loops connecting the different ele- 
ments are also relatively well defined, and only the longest loop 



Fig. 1 . Top , chemical shift index {CSV (55) of CCL20 based on the 
chemical shifts at 35 °C and pH 3.5. Regions in /3-strand or oc~ helix are 
depicted by arrows and a cylinder, respectively. Bottom, schematic 
representation of the antiparallel /3-sheet of CCL20 as determined from 
the NOE connectivities. Observed interstrand and sequential HN-HN 
NOEs are indicated by solid arrows. Slow exchanging amide protons 
are boxed. 


Table I 

Summary of input restraints 


Restraints 

Number 

Distance restraints 

1004 

Intraresidue (t — j = 0) 

200 

Sequential (|i - j\ = 1) 

239 

Short range (|i - j\ < 5) 

171 

Long range (|i -j\ > 5) 

394 

Hydrogen bonds 

20 X 2 

Total restraints 

1044 

Average restraints per residue (1-70) 

14.9 

Average restraints per residue (6-66) 

17.1 


(positions 27—34) exhibits residues with r.m.s. deviation values 
slightly greater than 1.0 A (Fig. 2). When this region is ex- 
cluded from the comparison, r.m.s. deviation values decrease 
but are not substantially lower than those reported for residues 
6-66. The quality of the final family of 20 structures is good, 
with no average NOE violations greater than 0.4 A Most of the 
backbone torsion angles fall in the most favored regions of 
Ramachandran space. The backbone conformation of the 20 
structures are shown superimposed in Fig. 3. Atomic coordi- 
nates for the lowest energy 20 converged structures have been 
deposited in the Protein Data Bank with accession code lha6. 

Description of the Structure — The structure of CCL20 has the 
general features previously determined for other chemokines. 
The N-terminal region (residues 1-4) preceding the CC motif is 
largely disordered with r.m.s. deviation values of 8.4 -1.8 A 
(Fig. 2). However, there are long distance NOEs between side 
chain protons of Asn 3 and Tyr 4 and those of Asp 33 and lie 34 , 
indicating that these N-terminal residues are in close proxim- 
ity to the protein core a significant fraction of the time. The 
N-loop region is bent in the CC motif (residues 6-7) and then 
extends to a turn of 3 10 helix (residues 17-19). This 3 10 helix, 
which is well defined in all of the structures, contains hydrogen 
bonds linking the Ala 19 and lie 20 amide protons to the carbonyl 
groups of Pro 16 and Ser 17 , respectively. This turn motif is also 
present in other chemokine structures (36). The helical turn is 
followed by the first /3-strand (residues 22-26) of a three- 
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Fig. 2. Top, number of NOB constraints per residue used in the 
CCL20 structure calculation. NOEs between residues i and j are cate- 
gorized as intraresidue (t = j, black bar), sequential (li — j\ = 1, dark 
gray bar), medium range (1 < \i — j\ = 4, light gray bar), and long range 
(li — j\ < 4, white bar). Middle, r.m.s. pairwise deviation (A) of backbone 
atoms between 20 structures of CCL20. Bottom , mean accessible sur- 
face area (%) per residue over the 20 solution structures of CCL20. 

Table II 


Summary of NMR structure calculations 


Residual 

constraint 

violations 

Sum 

Maximum 

Upper limits 

9.5 

A 

0.4 

Energy terms 

Average 

Range 

Total energy 

-1990 

kJ • mol -1 

-1408 to -2414 

Lennard-Jones energy 

-1347 

-860 to -1678 

NOE term 

32 

20-^0 

Pairwise r.m.s. deviation 

Backbone 

All heavy atoms 

All residues (1-70) 

1.70 ± 0.56 

A 

2.31 ± 0.42 

Residues 6-66 

0.53 ± 0.12 

1.36 ± 0.13 

Residues 6-26, 35— 66 

0.48 ± 0.10 

1.31 ± 0.17 


stranded antiparallel /3-sheet. Residues 36-40 and 47-49 com- 
prise the second and third /3-strand, respectively. As mentioned 
before, the loop connecting strands 1 and 2 is not as well 
defined as the rest of the molecule, whereas strands 2 and 3 are 
linked by ordered residues. Finally, strand 3 is connected by a 
turn to a regular a-helix extending from residue 54 to 66. The 
C-terminal residues show no long range NOE with the rest of 
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Fig. 3. Stereoscopic views of the CCL20 solution structure. 
Superposition of the 20 structures with the N terminus on the right and 
the C terminus on the left side and a-helix face (top) and N-loop face 
(middle). Bottom , ribbon representation of the lowest energy CCL20 
structure in the N-loop face orientation. 


the protein and are highly disordered, with large values of 
r.m.s. deviation, 1 . 5-4.6 A. 

The a-helix is packed roughly orthogonal against the /3-sheet 
axis. A significant number of hydrophobic interactions are 
formed between the sheet (Phe 23 , Phe 39 , and Val 47 ) and one 
side of the amphipathic a-helix (Trp 55 , Val 56 , Val 60 , and Leu 63 ). 
In addition, residues lie 11 , Pro 14 , and lie 15 in the N-terminal 
loop and lie 20 following the 3 10 helical turn also make nonpolar 
contacts with each other and with Phe 39 , Pro 51 , and Trp 55 in 
the core. The residues forming this pattern of hydrophobic 
contacts in CCL20 are well conserved throughout the entire 
chemokine family and have been postulated to be responsible 
for the structural similarities between the monomeric folds of 
CXC and CC chemokine subfamilies (56). Fig. 4 shows some 
details of the structure, the amphipathic a-helix, and part of 
the hydrophobic core. The a-helix is depicted with the hydro- 
phobic face oriented toward the sheet, on the left, and the 
solvent-exposed hydrophilic face on the right. 

The N-terminal region of the structure consists of a long loop 
that packs against the face of the /3-sheet not covered by the 
a-helix. Disulfide bonds and two intramolecular hydrogen 
bonds, HN Ile 1:L -0 Cys 48 and HN Asp 50 -OH Tyr 10 , not specif- 
ically included in the calculations, link this loop to the /3-sheet. 
The interface between the N-terminal region and the /3-sheet 
appears to be further stabilized by hydrophobic contacts be- 
tween the sidechains of Tyr 4 , Tyr 10 , Gin 26 , lie 34 , and Ala 49 . 
These contacts are mainly responsible for the conformation and 
the orientation of the N terminus of the protein, which pro- 
trudes from the structure in the opposite direction from the C 
terminus. Finally, the 3 10 turn is attached to the /3-sheet by 
means of a hydrogen bond, which was not explicitly included in 
the calculation, between the carbonyl oxygen of Ala 19 and the 
amide proton of Lys 42 . 

An analysis of the accessible surface (Fig. 2) shows that, in 
general, residues exhibiting low values of accessible surface 
area (ASA) are located in structural elements, participate in 
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Fig. 4. Stereoscopic view of the amphipatic C-terminal a-helix 
and the hydrophobic core of the solution structure of CCL20. 

Top , the a-helix is represented by the best superposition of the back- 
bone atoms of the Asn 64 — Arg 66 sequence. The left side of the helix is 
hydrophobic, and the right side of the helix is hydrophilic. Bottom , part 
of the hydrophobic core composed by side chains of residues belonging to 
the a-helix (Trp 55 , Val 60 , Leu 63 ), the /3-sheet (Phe 23 , Phe 39 , Val 47 ), the 
N-loop (Pro 14 , Tyr 10 ), and Pro 61 . 

the internal hydrophobic core, and show a well defined confor- 
mation for the entire side chain. In contrast, charged and polar 
residues show larger values of ASA and are less well defined. 
Exceptions to these general rules are the hydrophobic Tyr 4 , 
Leu 8 , Val 21 , Leu 62 , and Leu 65 , which are quite exposed (ASA, 
40-60%), and well ordered. 

DISCUSSION 

Comparison with CCL7 Global Fold — Several CC and CXC 
chemokine structures have been solved by NMR and x-ray 
crystallography in recent years (17-36). The available data 
clearly indicate that the monomeric structure of all chemokines 
is very similar but that they show a variety of association 
states. Monomers share a common topology; all contain a 3 10 
helix followed by a three-stranded antiparallel /3-sheet and a 
C-terminal a-helix. In particular, structural comparison of 
CCL20 with other CC chemokines produces superposition of 
the secondary elements with low r.m.s. deviation values indic- 
ative of structural similarity: CCL5/RANTES (26), 1.8 A; 
CCL4/MIP-1/3 (22), 1.7 A; CCL7/MCP-3 (31), 1.6 A; and CCL2/ 
MCP-1 (27), CCL 1 1/eotaxin (32), and CCL1/I-309 (36), 1.4 A. 
Despite this similarity, there are a few structural differences 
between structures that may be relevant for activity and pro- 
vide some clues on the basis of receptor specificity. 

Among chemokines with a CC motif whose three-dimen- 
sional structures have been solved, CCL20 structure is most 
similar to CCL7 (31), with which it shares the same N-terminal 
orientation. The cores of the CCL20 and CCL7 monomers su- 
perimpose relatively well with the exception of the 30’s loop, 
where remarkable differences are observed. The closed end of 
this loop points toward the helix face in the case of CCL7, 
whereas in CCL20 it lies near the N-terminal loop. This differ- 
ent orientation is due to different packing interactions with the 
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hydrophobic residues following the CC motif ( i.e . Tyr 10 in 
CCL20 and Phe 15 in CCL7). The importance of the orientation 
of this aromatic position, which appears to play a role in con- 
certed motions of the entire 30’s loop and the N terminus (via 
the disulfide bond), has been discussed previously (32). Both 
aromatic residues are relatively solvent-exposed, and the phe- 
nolic OH of Tyr 10 in CCL20 makes a hydrogen bond with the 
amide proton of Asp 50 located at the base of the helix. Phe 15 in 
CCL7 is obviously unable to form such an interaction. This 
hydrogen bond results in a distinct side chain orientation 
which forms very different hydrophobic interactions. Conse- 
quently, Phe 15 (CCL7) and Tyr lo (CCL20) cannot be considered 
to be structurally equivalent units. In addition to this Tyr/Phe 
substitution, there is another structural difference between 
CCL7 and CCL20: the relative position of the C terminus with 
respect to the sheet. This different orientation is due to the 
second Tyr/Phe swap. In CCL7, the aromatic residue in the 
first strand is Tyr 28 , whereas in CCL20, it is Phe 23 . Tyr 28 links 
the sheet to the C-terminal region of the helix by a hydrogen 
bond between its OH and the backbone O of Lys 70, bending the 
structure at the C terminus. The lack of this interaction in 
CCL20 produces a slightly different orientation at the C termi- 
nus. Some other differences between these structures possibly 
related with activity and receptor binding of CCL20 are dis- 
cussed below. 

Implications for Dimerization — Ultracentrifugation analysis 
shows that CCL20 exists in an association, presumably mono- 
mer-dimer, equilibrium. The limitations of homonuclear NMR 
methods make it very difficult to detect intermolecular contacts 
that could account for a multimeric structure. The possible 
intersubunit NOEs may be under strong peaks, below the noise 
level or not detected because of the low population of these 
states. However, additional structural NMR data other than 
NOE, namely the presence of chemical exchange peaks and the 
signal loss because of broadening, clearly prove the presence of 
supramolecular structures. Therefore, if this associated state of 
CCL20 present under our experimental conditions were a 
dimer, we would propose that it is formed by association of the 
first /3-strand (pf) of each monomer to form a six-stranded 
antiparallel sheet. This structure is consistent with the mono- 
mer geometry and all of the NMR evidence in hand and has 
been observed in both CXC and CC chemokines, like CCL7 and 
CCL2 (30, 31, 57). It has been shown that only a few conserved 
residues, characteristic of each chemokine subfamily, are in- 
volved in dimer stabilization (56). In the proposed dimeric 
structure of CCL20, Phe 23 (strand 1), Arg 25 (strand 1), Ser 64 
(helix), and Leu 63 (helix) could account for the quaternary 
contacts. In a similar way, the homologous residues Tyr 28 , 
Arg 30 , Asp 68 , and Leu 67 form the dimer contacts in CCL7 (31). 

Structure-Function ofCCL20 — Several studies have charac- 
terized the interaction between chemokines and cellular recep- 
tors (1, 4, 6). Chemokines bind to negatively charged sequences 
at the N termini of the cellular chemokine receptors and also to 
charged residues on some of the external loops. It is also known 
that amino acid residues preceding the first cysteine and the 
N-loop are important for binding and activating the receptor 
(58). Although chemokines share significant sequence similar- 
ity in regions of common secondary structure, the N terminus 
of each chemokine is largely distinct, even within members of 
the same subfamily (59). However, it is common for chemokine 
sequences to contain an acidic residue among the first residues. 
For example, studies done on the ELR motif near the N termi- 
nus of CXC chemokines (60) suggest that this motif is involved 
in hydrophobic and electrostatic contacts with extracellular 
loops 3 and 4 of the receptor. It is well known that most 
chemokines can bind to more than one receptor in a distinct 
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Fig. 5. Stereoscopic view of the 
groove between the N-loop and the 
/3 2 -/3 3 hairpin. Backbone trace of the su- 
perposition of CCL20 (residues 12-26 and 
37-49; red) and CCL7 (31) (residues 
16-30 and 41-53; blue). a-Carbons are 
labeled in the CCL20 trace. 




and specific manner and that one residue at the N terminus 
( e.g . Asp 6 in CCL5) can be involved in the activation of more 
than one receptor (61). To make these interactions, such key 
residues need to be located in flexible regions that can be 
adjusted to fit the distinct surface topology of different 
receptors. 

The existence of an Asp-Cys-Cys-Leu (DCCL) motif in the 
N-end of CCL20, shared only by human and murine CCL19/ 
ELC, CCL21/SLC, and CCL25/TECK and murine CCL9-10/ 
C 10-like has been reported (8), although the contributions of 
this motif to function are unknown. Recently, human CCL20 
variants that have short deletions in the flexible N termini but 
maintain the DCCL motif and/or the C termini (positions 2—70, 
1-69, 1—67, 1-64, 2-69, 2-67, and 2-64) were all shown to be 
active in intracytoplasmic calcium mobilization as well as che- 
motactic assays (62). Inspection of the CCL20 structure shows 
that residues Asp 5 and Leu 8 have a very well defined confor- 
mation, with both side chains oriented in the same direction 
and largely exposed to the solvent. These characteristics sug- 
gest that these residues are good candidates to make specific 
interactions with the receptor. Asp 5 and Leu 8 flank the CC 
motif held against the sheet by two S^S bonds. Consequently, 
this region should be quite rigid in contrast to the ELR motif 
present in some CXC chemokines, which is located far from 
disulfide bonds. This difference probably has implications for 
receptor specificity. We propose that the rigidity of the DCCL 
motif limits its ability to adopt alternate conformations that 
would allow it to recognize and bind different receptors. Hu- 
man chemokines with a DCCL motif bind to two receptors (one 
specific (CCR7 for CCL19 and CCL21 or CCR9 for CCL25) and 
the another one shared (CCR11)). Therefore, they also bind 
selectively, although not so much as CCL20. 

However, the presence of the DCCL motif in these other 
non-CCR6-binding chemokines strongly suggests that other 
parts of CCL20 play a role in its specific binding to CCR6. In 
other chemokines, such as RANTES (63), multiple structural 
regions are required for specificity. Structures of both CC (54) 
and CXC (59) chemokines complexed with peptide models of 
their receptors show that the peptide receptor mimics bind in a 
groove formed by the N-loop and the 0 2 -0 3 hairpin of the 
chemokines. Recently, a model for chemokine/receptor recogni- 
tion has been proposed based on the binding of residue motifs 
and correct binding to the chemokine groove (59). 

While many of the residues defining the edges of this groove 
channel are charged or polar, the floor of the groove is formed 
predominantly by hydrophobic side chains. Interestingly, re- 
markable differences are observed in the size of the groove; in 
CCL20, it is much more narrow than in CCL7 or CXCL8. Fig. 
5 shows a superposition of CCL20 and CCL7 focusing on this 
region. Both structures match the orientation of the /3 2 -/3 3 
hairpin, but significant discrepancies are observed in the con- 
formation of the N-loops that narrow the groove in CCL20. 



Fig. 6. Ribbon representation of the chemokine CCL20 (left) 
and the 0-defensin BD2 (37) (right). /3-Sheets are represented by 
arrows ; arginine and lysine residues are shown in blue\ Asp 6 in CCL20 
and Asp 4 in BD2 are depicted in red\ and Leu 8 in CCL20 and Leu 9 in 
BD2 are shown in green. 

Although it is difficult to precisely quantify the width of the 
groove, an idea of its dimensions can be obtained by comparing 
the corresponding backbone positions in CCL20, CCL7, and 
CXCL8. Thus, whereas the distance between a-carbons of Lys 42 
(in the middle of the 0 2 -0 3 hairpin) and He 20 (on the top of the 
N-loop in Fig. 5) is about 6 A in CCL20, it is 11 A between 
equivalent positions in CCL7 (Leu^-Arg 25 ) and 9 A in CXCL8 
(Asp 45 -Lys 23 ). Other related distances between both segments 
are always smaller in CCL20 than in the other proteins (e.g. 
Arg 44 — Thr 13 , 11 A in CCL20; Lys 46 -Phe 15 , 18 A in CCL7; and 
Glu^-Phe 17 , 14 A in CXCL8). It is tempting to speculate that 
the narrow channel in CCL20 can only fit a very specific coun- 
terpart and contributes to its highly specific receptor binding. 
We therefore propose that the narrow groove and the rigidity at 
the DCCL motif can account for the singularly specific binding 
between CCL20 and its CCR6 receptor. 

Comparison with Human 0 -Defensins — Recently, human 
0-defensins have been shown to bind to the CCR6 receptor with 
significant affinity (16). Although 0-defensins show no se- 
quence homology with chemokines, it has been postulated that 
they could have similar tertiary CCR6 binding sites and act as 
“microchemokines.” To date, two 0-defensins, BD1 and BD2, 
have been described in humans, and the structure of the latter 
has been solved by crystallography (37). The monomeric struc- 
ture of BD2, stabilized by three disulfide bridges, is composed 
of a three-stranded 0-sheet (residues 14-16, 25-28, and 36-39) 
and a short a-helical region (residues 5—10) held against the 
sheet by a S-S bond. A comparison between this 0-defensin and 
the CCL20 backbone structure is depicted in Fig. 6, showing 
their positively charged side chains, the motif DCCL in CCL20, 
and Asp 4 and Leu 9 in BD2. As can easily be seen, 0-defensin 
can structurally be considered as a simplified form of CCL20, 
with a truncated N-terminal end and lacking the C -terminal 
helix. In the orientation presented in Fig. 6, it is evident that 
the position of the 0-sheets is similar in both molecules, despite 
their different topology (0J up, 0 2 down, 0 3 up, in CCL20, and 
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(3 1 up, /3 2 up, /3 3 down in BD2). Moreover, the positively charged 
side chains (Arg 18 , Arg 25 , Lys 42 , Lys 43 , Arg 44 , and Lys 45 in 
CCL20 and Lys 10 , Arg 22 , Arg 23 , Lys 25 , Lys 36 , Lys 39 , and Lys 40 
in BD2) are distributed in a similar way. Thus, the /3-sheet and 
the specific distribution of charged residues seem to be common 
features in both structures. We have already discussed the 
possible importance of the groove’s width between the N-loop 
and the /3 2 -/3 3 hairpin, and the DCCL motif for CCL20 specific- 
ity. Interestingly, BD2 has two similar residues, Asp 4 and 
Leu 9 , placed in similar position and orientation as those of Asp 5 
and Leu 8 in CCL20. If the DCCL motif plays a role in the 
binding of CCL20 to CCR6, the pair Asp 4 -Leu 9 in BD2 could 
provide similar interactions. These residues are located in a 
regular a-helix structure strongly held against the protein core 
by a disulfide bridge. In addition, these residues are moder- 
ately conserved among /3-defensin sequences (37); this supports 
the idea that specific interactions with CCR6 need appropriate 
groups in a fixed rigid orientation. Whereas a structural ho- 
mology of the previously mentioned CCL20 groove is not obvi- 
ous at first glance in BD2, a closer inspection reveals a region 
that could bind CCR6 in a similar fashion. The CCL20 groove 
is oriented face-on in Fig. 6; therefore, if BD2 binds CCR6 in a 
similar way as CCL20 does, by structural analogy the interact- 
ing region in BD2 would comprise j3 2 (residues 14—16), the loop 
connecting p x and /3 2 (residues 17-24), and residues 36-40. 
Strikingly, sequence alignment of the fragment 14-27 of BD2 
(IHPVFCPRRYKGI) reveals a significant degree of similarity 
with the N-loop region 11-23 of CCL20 (IQTPLPSRAIVGF), and 
the charged end of the sequence of BD2 (36-40, KCCKK) corre- 
sponds to the charged sequence 42-45 (KKRK) in the J3 2 -/3 S 
hairpin of CCL20. These correspondences and the close struc- 
tural similarity may provide BD2 with the necessary groups to 
interact specifically with the CCR6 receptor, although with less 
affinity than CCL20, as has been described (19). 

Clusters of positive charges have been claimed to play a role 
in the activities of both defensins and chemokines (37, 60). We 
have compared the positive charge distribution of BD2 and 
CCL20 with other chemokines, looking for differences. We 
found that BD2 and CCL20 have a highly asymmetrical distri- 
bution of the positive charge (Fig. 6) that is also shared by other 
chemokines examined (not shown). This suggests that these 
charged clusters may play a common role in the activity of all 
of these proteins but are not important for determining their 
receptor binding specificity. 

In summary, we have determined the structure of CCL20, 
the chemokine with the most selective known receptor binding 
properties. Our data suggest that this strict receptor selectivity 
is related to the rigid conformation of the DCCL motif as well 
as the narrow groove formed by the N-loop and the /3 2 -/3 3 
hairpin, compared with other chemokines. Some of these struc- 
tural features are also found in the defensin BD2, thus sup- 
porting their importance for the binding of both CCL20 and 
BD2 to CCR6. The determination of the three-dimensional 
structure of CCL20 will aid the rational design of variants to 
modulate or interfere with its important biological functions. 
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